Goto

Collaborating Authors

 functional role


QML-HCS: A Hypercausal Quantum Machine Learning Framework for Non-Stationary Environments

arXiv.org Artificial Intelligence

QML-HCS is a research-grade framework for constructing and analyzing quantum-inspired machine learning models operating under hypercausal feedback dynamics. Hypercausal refers to AI systems that leverage extended, deep, or nonlinear causal relationships (expanded causality) to reason, predict, and infer states beyond the capabilities of traditional causal models. Current machine learning and quantum-inspired systems struggle in non-stationary environments, where data distributions drift and models lack mechanisms for continuous adaptation, causal stability, and coherent state updating. QML-HCS addresses this limitation through a unified computational architecture that integrates quantum-inspired superposition principles, dynamic causal feedback, and deterministic-stochastic hybrid execution to enable adaptive behavior in changing environments. The framework implements a hypercausal processing core capable of reversible transformations, multipath causal propagation, and evaluation of alternative states under drift. Its architecture incorporates continuous feedback to preserve causal consistency and adjust model behavior without requiring full retraining. QML-HCS provides a reproducible and extensible Python interface backed by efficient computational routines, enabling experimentation in quantum-inspired learning, causal reasoning, and hybrid computation without the need for specialized hardware. A minimal simulation demonstrates how a hypercausal model adapts to a sudden shift in the input distribution while preserving internal coherence. This initial release establishes the foundational architecture for future theoretical extensions, benchmarking studies, and integration with classical and quantum simulation platforms.


A Case for AI Consciousness: Language Agents and Global Workspace Theory

arXiv.org Artificial Intelligence

It is generally assumed that existing artificial systems are not phenomenally conscious, and that the construction of phenomenally conscious artificial systems would require significant technological progress if it is possible at all. We challenge this assumption by arguing that if Global Workspace Theory (GWT) - a leading scientific theory of phenomenal consciousness - is correct, then instances of one widely implemented AI architecture, the artificial language agent, might easily be made phenomenally conscious if they are not already. Along the way, we articulate an explicit methodology for thinking about how to apply scientific theories of consciousness to artificial systems and employ this methodology to arrive at a set of necessary and sufficient conditions for phenomenal consciousness according to GWT.


Understanding the Functional Roles of Modelling Components in Spiking Neural Networks

arXiv.org Artificial Intelligence

Spiking neural networks (SNNs), inspired by the neural circuits of the brain, are promising in achieving high computational efficiency with biological fidelity. Nevertheless, it is quite difficult to optimize SNNs because the functional roles of their modelling components remain unclear. By designing and evaluating several variants of the classic model, we systematically investigate the functional roles of key modelling components, leakage, reset, and recurrence, in leaky integrate-and-fire (LIF) based SNNs. Through extensive experiments, we demonstrate how these components influence the accuracy, generalization, and robustness of SNNs. Specifically, we find that the leakage plays a crucial role in balancing memory retention and robustness, the reset mechanism is essential for uninterrupted temporal processing and computational efficiency, and the recurrence enriches the capability to model complex dynamics at a cost of robustness degradation. With these interesting observations, we provide optimization suggestions for enhancing the performance of SNNs in different scenarios. This work deepens the understanding of how SNNs work, which offers valuable guidance for the development of more effective and robust neuromorphic models.


Artificial Neural Nets and the Representation of Human Concepts

arXiv.org Artificial Intelligence

What do artificial neural networks (ANNs) learn? The machine learning (ML) community shares the narrative that ANNs must develop abstract human concepts to perform complex tasks. Some go even further and believe that these concepts are stored in individual units of the network. Based on current research, I systematically investigate the assumptions underlying this narrative. I conclude that ANNs are indeed capable of performing complex prediction tasks, and that they may learn human and non-human concepts to do so. However, evidence indicates that ANNs do not represent these concepts in individual units.


Taking the Intentional Stance Seriously, or "Intending" to Improve Cognitive Systems

arXiv.org Artificial Intelligence

Finding claims that researchers have made considerable progress in artificial intelligence over the last several decades is easy. However, our everyday interactions with cognitive systems (e.g., Siri, Alexa, DALL-E) quickly move from intriguing to frustrating. One cause of those frustrations rests in a mismatch between the expectations we have due to our inherent, folk-psychological theories and the real limitations we experience with existing computer programs. The software does not understand that people have goals, beliefs about how to achieve those goals, and intentions to act accordingly. One way to align cognitive systems with our expectations is to imbue them with mental states that mirror those we use to predict and explain human behavior. This paper discusses these concerns and illustrates the challenge of following this route by analyzing the mental state 'intention.' That analysis is joined with high-level methodological suggestions that support progress in this endeavor.


Enabling Integration and Interaction for Decentralized Artificial Intelligence in Airline Disruption Management

arXiv.org Artificial Intelligence

Airline disruption management traditionally seeks to address three problem dimensions: aircraft scheduling, crew scheduling, and passenger scheduling, in that order. However, current efforts have, at most, only addressed the first two problem dimensions concurrently and do not account for the propagative effects that uncertain scheduling outcomes in one dimension can have on another dimension. In addition, existing approaches for airline disruption management include human specialists who decide on necessary corrective actions for airline schedule disruptions on the day of operation. However, human specialists are limited in their ability to process copious amounts of information imperative for making robust decisions that simultaneously address all problem dimensions during disruption management. Therefore, there is a need to augment the decision-making capabilities of a human specialist with quantitative and qualitative tools that can rationalize complex interactions amongst all dimensions in airline disruption management, and provide objective insights to the specialists in the airline operations control center. To that effect, we provide a discussion and demonstration of an agnostic and systematic paradigm for enabling expeditious simultaneously-integrated recovery of all problem dimensions during airline disruption management, through an intelligent multi-agent system that employs principles from artificial intelligence and distributed ledger technology.


The heads hypothesis: A unifying statistical approach towards understanding multi-headed attention in BERT

arXiv.org Artificial Intelligence

Multi-headed attention heads are a mainstay in transformer-based models. Different methods have been proposed to classify the role of each attention head based on the relations between tokens which have high pair-wise attention. These roles include syntactic (tokens with some syntactic relation), local (nearby tokens), block (tokens in the same sentence) and delimiter (the special [CLS], [SEP] tokens). There are two main challenges with existing methods for classification: (a) there are no standard scores across studies or across functional roles, and (b) these scores are often average quantities measured across sentences without capturing statistical significance. In this work, we formalize a simple yet effective score that generalizes to all the roles of attention heads and employs hypothesis testing on this score for robust inference. This provides us the right lens to systematically analyze attention heads and confidently comment on many commonly posed questions on analyzing the BERT model. In particular, we comment on the co-location of multiple functional roles in the same attention head, the distribution of attention heads across layers, and effect of fine-tuning for specific NLP tasks on these functional roles.


Are the Brain's Electromagnetic Fields the Seat of Consciousness? - Facts So Romantic

Nautilus

Christof Koch is a neuroscientist distinguished by his rock-solid scientific work and romantic yearning to understand consciousness. He recently closed an essay by wondering: "What is it about the brain, the most complex piece of active matter in the known universe, that turns its activity into the feeling of life itself?" No coincidence with that phrasing--The Feeling of Life Itself is his latest book. He argues that consciousness is produced by the brain but that it's also more widespread in nature than we might suppose. His essay described new experimental work, from Stanford neuroscientist Kieran Fox and his colleagues, that explored the effects of electrically stimulating the brain, which revealed an ordering principle.


Method to assess the functional role of noisy brain signals by mining envelope dynamics

arXiv.org Machine Learning

Data-driven spatial filtering approaches are commonly used to assess rhythmic brain activity from multichannel recordings such as electroencephalography (EEG). As spatial filter estimation is prone to noise, non-stationarity effects and limited data, a high model variability induced by slight changes of, e.g., involved hyperparameters is generally encountered. These aspects challenge the assessment of functionally relevant features which are of special importance in closed-loop applications as, e.g., in the field of rehabilitation. We propose a data-driven method to identify groups of reliable and functionally relevant oscillatory components computed by a spatial filtering approach. Therefore, we initially embrace the variability of decoding models in a large configuration space before condensing information by density-based clustering of components' functional signatures. Exemplified for a hand force task with rich within-trial structure, the approach was evaluated on EEG data of 18 healthy subjects. We found that functional characteristics of single components are revealed by distinct temporal dynamics of their event-related power changes. Based on a within-subject analysis, our clustering revealed seven groups of homogeneous envelope dynamics on average. To support introspection by practitioners, we provide a set of metrics to characterize and validate single clusterings. We show that identified clusters contain components of strictly confined frequency ranges, dominated by the alpha and beta band. Our method is applicable to any spatial filtering algorithm. Despite high model variability, it allows capturing and monitoring relevant oscillatory features. We foresee its application in closed-loop applications such as brain-computer interface based protocols in stroke rehabilitation.


Differentiating Between “Functional” and “Semantic” Roles in a High-Level Conceptual Data Modeling Language

AAAI Conferences

We discuss in this paper, from a pragmatic and operational point of view, the need of a clear differentiation between functional and semantic “roles.” In the first case, according to the linguistic and computational linguistics tradition, roles are seen as relations linking a semantic predicate to its arguments. In the second, in conformity with the ontological and Semantic Web practice, roles are equated to ordinary concepts to be inserted into a standard ontology. As we will show here, the two notions can successfully co-exist in the framework of a high level conceptual modeling language.